Summary statistics of PMLB datasets

2020-07-30

The plot below is created with plotly. Each dot represents a dataset colored based on its associated task (classification vs. regression). In log scale, the x and y axis shows the number of observations and features respectively. Please click on the legend to hide/show the groups of datasets. Click on each dot to access the dataset’s pandas-profiling report.

Browse, sort and search the complete table of summary statistics below: